AITopics | corrupt data

Collaborating Authors

corrupt data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Hard Samples, Bad Labels: Robust Loss Functions That Know When to Back Off

Pellegrino, Nicholas, Szczecina, David, Fieguth, Paul

arXiv.org Artificial IntelligenceNov-27-2025

Incorrectly labelled training data are frustratingly ubiquitous in both benchmark and specially curated datasets. Such mislabelling clearly adversely affects the performance and generalizability of models trained through supervised learning on the associated datasets. Frameworks for detecting label errors typically require well-trained / well-generalized models; however, at the same time most frameworks rely on training these models on corrupt data, which clearly has the effect of reducing model generalizability and subsequent effectiveness in error detection -- unless a training scheme robust to label errors is employed. We evaluate two novel loss functions, Blurry Loss and Piecewise-zero Loss, that enhance robustness to label errors by de-weighting or disregarding difficult-to-classify samples, which are likely to be erroneous. These loss functions leverage the idea that mislabelled examples are typically more difficult to classify and should contribute less to the learning signal. Comprehensive experiments on a variety of artificially corrupted datasets demonstrate that the proposed loss functions outperform state-of-the-art robust loss functions in nearly all cases, achieving superior F1 scores for error detection. Further analyses through ablation studies offer insights to confirm these loss functions' broad applicability to cases of both uniform and non-uniform corruption, and with different label error detection frameworks. By using these robust loss functions, machine learning practitioners can more effectively identify, prune, or correct errors in their training data.

artificial intelligence, loss function, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2511.16512

Country: North America > Canada (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

New tool lets artists fight AI image bots by hiding corrupt data in plain sight

EngadgetOct-24-2023, 09:55:19 GMT

From Hollywood strikes to digital portraits, AI's potential to steal creatives' work and how to stop it has dominated the tech conversation in 2023. The latest effort to protect artists and their creations is Nightshade, a tool allowing artists to add undetectable pixels into their work that could corrupt an AI's training data, the MIT Technology Review reports. University of Chicago professor Ben Zhao and his team created Nightshade, which is currently being peer reviewed, in an effort to put some of the power back in artists' hands. They tested it on recent Stable Diffusion models and an AI they personally built from scratch. Nightshade essentially works as a poison, altering how a machine-learning model produces content and what that finished product looks like.

artist fight ai image bot, corrupt data, nightshade, (3 more...)

Engadget

Country: North America > United States > Illinois > Cook County > Chicago (0.27)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Breaking Down the AI Revolution

#artificialintelligenceMay-1-2021, 04:30:08 GMT

With the various terms surrounding Artificial Intelligence (AI) and use cases in business today, it is hard to keep up with all of the new innovation across industries. As AI technology and techniques continue to evolve around us, so do the businesses that use them. Artificial Intelligence can be simply defined in one sentence as the science and engineering of making computers behave in ways that, until recently, we thought required human intelligence. TowardsAI reports, in contrast to Machine Learning, AI is a moving target, and its definition changes as its related technological advancements turn out to be further developed. Machine Learning is one of the ways we expect to achieve AI.

application, deep learning, learning, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.43)

Add feedback